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Given a binary input channel W, let E (p,W) denote "Gallager's Eq" [1, p. 138] evaluated 
for the uniform input distribution: 



E (p,W) = - log 
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00 . In this note we prove that the following relation 

^S: E (p,W-) + E (p,W + )>2E (p,W) (2) 

ta- 
bloids for any binary input discrete memoryless channel (B-DMC) W, and p > 0. The channels 

W~, and W + denote the synthesized channels after the application of the one step polarization 

transformations defined by Ankan [2]. Their transition probabilities are given by: 

W-( yi y 2 | m) = l W (Vi I M i © u 2 )W(y 2 | u 2 ) (3) 

M2€{0,1} 

W + { yi y 2Ul | u 2 ) = -W(yi | Mi © m 2 )W% 2 | u 2 ). (4) 

The special case of the relation above with p = 1 was proved in [2]. Another special case of 
the relation, by first dividing by p and taking the limit as p tends to zero is also shown in [2] 
as a consequence of the chain rule for mutual information. We simply provide the extension of 
these results to arbitrary, non-negative values of p. 



Proof: By Lemmas 1, 2, and 3 proved in the Appendix, we know that 

E (p,W) = -\ogE[g(p,Z)) 
E (p, W~) = -\ogE\g(p, Z 1 Z 2 )} 
E (p,W + ) = -\ogE[h(p,Z 1 ,Z 2 )] 

where Z,Z 1} Z 2 are independent, identically distributed random variables taking values in the 
[0, 1] interval, and 
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h(p,z 1 ,z 2 ) = -(1 + z 1 z 2 )g(p, Zl + Z2 \ + h\- ZlZ2 )gfp^ — ?Ly 
By these identities, showing (2) is equivalent to showing 

E[<7(p, Z 1 )]E[^(p, Z 2 )] > E[g(p, Z 1 Z 2 )]E[h(p, Z U Z 2 )]. 



The proof is carried in two steps. We first claim that the following inequality is satisfied: 

g(p, zi)g(p, z 2 ) > g(p, z x z 2 )h{p, z 1: z 2 ) (5) 
for any zi, z 2 G [0, 1], and p > 0. 

Taking the expectation of both sides in (5) and noting the independence of Z x and Z 2 gives 

E [g(p, Z x )\ E [g(p, Z 2 )\ = E [g(p, Z x )g{p, Z 2 )\ > E [g(p, Z x Z 2 )h{p, Z x , Z 2 )\ . (6) 

By Lemma 5 in the Appendix, the function g(p, Z\Z 2 ) is non-increasing in z±, and z 2 separately 
for any p > 0. Similarly, by Lemma 6 in the Appendix the function h{p,zi,z 2 ) is also non- 
increasing in both z\, and z 2 separately for any p > 0. The monotonicity properties are useful as 
they imply (see, e.g., [3, Ch. 9, p. 446-447]) that the random variables g(p, Z\Z 2 ) and h(p, Z±, Z 2 ) 
are positively correlated. As a result 

E [g(p, Z l )} E [g(p, Z 2 )\ > E [g(p, Z l Z 2 )h{p, Z u Z 2 )\ > E [g(p, Z X Z 2 )\ E [h(p, Z U Z 2 )] , (7) 

concluding the proof of the relation given in (2). 



Now, we prove the claimed inequality in (5). For that purpose, we first apply the transforma- 
tions 

s — , t — arctanh z 1 , w = arctanh z 2 , A; = arctanh (z^) 

1 + p 

where s G [0, 1], and t,w,k e [0, oo). Using these, we obtain 



/l — s , , ,\ cosh(st) " 

g ( tanh t = (8) 

V s J cosh(t) 

/1 — s , , ,\ cosh(sw)« 

g ( ,tanhH = (9) 

V s J cosh(w) 



and 



1 — s , cosh(s/c) s 

— - tanh «)=^r <10) 

A (i^, tanh ( f ), t a^w) = cosh(s(t rw,rt ( f <"> 

V s / 2 cosh(t) cosh(w) 

We further define the transformations 

a = t + w, b = t — w 

, , a + b , a — b . . 

such that t = , and w = where a > \b\. Then, the variable k is given by 

1 , / cosh(a)\ 

T, § ( r~77T" ) • (12) 

2 \cosh(o) / 

Therefore, the expression in (10) becomes 

cosh(a) s + cosh(fr)'' 



9 ( V' tanh(A;) ) = V cosh(q) + cosh(&) — • (13) 
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After a few manipulations on the product of the equations (8), and (9), one can check that 
the LHS of (5) is given by 



cosh(sa)+cosh(sfc) ^ s 



cosh(t) cosh(u>) 

Similarly, using equations (11), and (13), the RHS of (5) is given by 



(14) 



/ cosh(q) s +cosh(b) s \ » 1 1 

\ 2 J cosh(sa)~ + cosh(s6)~ 
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co S h(a)+cosh(fc) ' 2 cosh(f) cosh(w) 



Therefore, we obtain that the inequality (5) is equivalent to 
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Let u = cosh ( 6s ) ° an( j v — cos ^ b \ . Then, by Lemma 5 in the Appendix, whenever a > b > 0, 

cosh(as)- cosh(a) 

we have u > v since 

, cosh(6s)~ cosh(as)« , , . 

cosh(o) cosh(a) 

Moreover, we have u > v whenever a > \b\ by symmetry of the function / s (.) around zero. 



As a result, we have reduced the inequality (5) to the following form: 

F s (u) > F s (v) where u> v. 

But, we know this is true by Lemma 4 in the Appendix. This proves inequality (5) holds as 
claimed. ■ 

Appendix 

Lemma 1: [4] Given a channel W and p > 0, there exist a random variable Z taking values 
in the [0,1] interval such that 

E (p,W) = -\ogE[g(p,Z)} (17) 



where 



I > I 



g(p, Z )=[-(l + Z )— p+ -(l- Z )— p ) . (Ig) 



Proof: Recall E (p, W) = - logj^ 



^W(y | 0)^ + hjV{ y | i)tt? 



Define 



= Vj^, | 0) ± W(y | 1) ^ = (19) 





o) - W(y 


1) 




0) + W% 


1) 



2 

so that W(y | 0) = q(y)[l + A(y)] and W(y \ 1) = q(y)[l - A(y)}. Then, one can define the 
random variable Z = \A(Y)\ £ [0, 1] where Y has the probability distribution q(y), and obtain 
(17) by simple manipulations. ■ 



Lemma 2: Given a channel W and p > 0, let Z 1 and Z 2 be independent copies of the random 
variable Z defined in Lemma 1. Then, 



Eo(p, W-) = -\ogE[g(p,Z 1 Z 2 )} 



(20) 



where g(p, z) is given by (18). 

Proof: From the definition of channel W~ in (3), we can write 



E (p,W-) = -log J2 

2/1,2/2 
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-log ^ 
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((1 + A ( yi )) (1 + A (y 2 )) + (1 - A fa)) (1 - A (y 2 ))) 
+ ((1 - A fa)) (1 + A (y 2 )) + (1 + A fa)) (1 - A fa))) ^ 
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- (1 + A( yi )A(y 2 )) ^ + - (1 - Afa) A(y 2 )) ^ 



where we used (19). We can now define Z\ = |A(Yi)| and Z 2 = |A(F 2 )| where Y 1 and Y 2 are 
independent random variables with distribution q. From this construction, the lemma follows. ■ 
Lemma 3: Given a channel W and p > 0, let Z 1 and Z 2 be as in Lemma 2. Then, 



E (p, W + ) = - log! 



i(i + ^ 2 ) ,(p,A±|) + i (1 -z A) .(„,^|) 
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where y(p, z) is given by (18). 



Proof: From the definition of channel W + in (4), we can write 

E (p,W+) 

= -io g 
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Using (19), we have 

E (p, W + ) 
= -log^ \ q(yi) q{y2) 
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((1 + A( yi )) (1 + A(y 2 ))) 1+ " + ((1 - A( yi )) (1 - A(y 2 ))) ^ 
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= - log f \ ^ (1 + A(yi) A(y 2 )) 
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where g(p, z) is defined in (18). 



Similar to the E (p, W~) case, we define Z x = | A(Yi) | and Z 2 = \A(Y 2 )\ where Y 1 and 
Y 2 are independent random variables with distribution q. However, we should check whether 
this construction is equivalent to the above equation. We note that A(y) G [—1, 1]. When A^) 
and A(y 2 ) are of the same sign, we can easily see (noting that g(p, z) is symmetric about z — 0) 
that 

When A(yi) and A(y 2 ) are of the opposite sign, we note that 

Since we are interested in the sum of the above two parts, we can see that the construction we 
propose is still equivalent. This concludes the proof. ■ 

1 — 3 i 

Lemma 4: For s G [0, 1], define the function F s : [0, 1] — > [1, 2~] as 

F s (x) = (l±pl. (22) 

Then, F s is a non-decreasing function. 

Proof: Taking the derivative of F s (x) with respect to x, we have 

cfe sW ~ x(l + x) 2 " 
since (x s — x) > for x, s G [0, 1]. ■ 

Lemma 5: For s G [0, 1], define the function / s : [0, oo) — > [2~ , 1] as 

/.(*) = (23) 
cosh(A;) 

Then, / s is a non-increasing function. Moreover, this implies the function g(p, z) defined in (18) 
is non-increasing in the variable z G [0, 1] for any fixed p > 0. 



Proof: We can equivalently show that \og(f s (k)) is non-increasing in k. Taking the first 
derivative gives 

( - log(cosh(A;s)) — log(cosh(A;)) J = tanh(sA;) — tanh(/c) < 
ok \s J 

as tanh(-) is increasing in its argument. 

To prove the second monotonicity relation, we let k = arctanhz, and s = j^. Then, 

g(p,u) = / 1 (arctanhz). 

Since k = arctanh z is a monotone increasing transformation, it follows that the function g(p, z) 
is non-increasing in z for fixed values of p. ■ 
Lemma 6: The function h : [0, 00) x [0, 1] x [0, 1] ->■ [2~ p , 1] defined as 

h(p,z 1 ,z 2 ) = j-(l + z 1 z 2 )g (p, Zl + Z2 )+-(!- z x z 2 )g ( p, 



I + Z1Z2J 2 \ l-z 1 z 2/ 

where g(p,z) is given by (18), is non-increasing in the variables z\ and z 2 separately for any 

P > 0. 

Proof: By the symmetry of h with respect to z 1 and z 2 , it suffices to show the claim for 

z 1 alone. In the expression below, we will suppress p in all function arguments, and denote 

g'(u) = -§^g(p, u). Taking the derivative of h with respect to z\, we get 

d , , 1 f Zi + z 2 \ l-z 2 2 ,( Zi + z 2 \ 

T—h{z u z 2 ) = -z 2 g\ — + — — g — 

ozi 2 \l + z 1 z 2 J 2(l + ziz 2 ) \l + z 1 z 2 J 

1 ( zi - z 2 \ 1 - z 2 2 , f Zi - z 2 \ 
2 Z2 Hl-W 2(1-^) 9 \l-z1z2) 
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-z 2 
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l + z l z 2 J \l-z l z 2 
1 - z 2 2 , ( z 1 + z 2 \ 
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+ 2(1 + Zl z 2 ) 9 \l + Zl z 2 ) 
1 — z 2 2 f f Z\ — z 2 \ 



)] 



+ 2(1 - ziz 2 ) 9 \l - z x z 2 ) 

The last two terms that contain g'(-) are negative by Lemma 5, so it suffices to show that 

( zi + z 2 \ ( Zi- z 2 
9 t; < 9 



l + ZiZ 2 J \1 - ZiZ 2 

To that end, observe that, for any z±,z 2 G [0, 1] we have 

z \ + z 2 \z\ — z 2 \ 



1 + Z X Z 2 1 - ZiZ 2 

and by Lemma 5 and the symmetry of g around z — 0, the required inequality follows. 
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